Skip to content

chore: Merge set of changes for release v3.1.0#630

Merged
dushyantbehl merged 30 commits into
releasefrom
v3.1.0-rc3
Nov 10, 2025
Merged

chore: Merge set of changes for release v3.1.0#630
dushyantbehl merged 30 commits into
releasefrom
v3.1.0-rc3

Conversation

@dushyantbehl
Copy link
Copy Markdown
Collaborator

Description of the change

Related issue number

How to verify the PR

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

dushyantbehl and others added 29 commits July 24, 2025 21:28
Signed-off-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
…593)

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
…ncessary memory overhead (#592)

* Added use_cache in the model args

Signed-off-by: romit <romit@ibm.com>

* Removed model args use_cache

Signed-off-by: r0 <11757603+romitjain@users.noreply.github.com>
Signed-off-by: romit <romit@ibm.com>

* Update sft_trainer.py, default use_cache

Signed-off-by: r0 <11757603+romitjain@users.noreply.github.com>
Signed-off-by: romit <romit@ibm.com>

* Updated cache usage in vision models

Signed-off-by: romit <romit@ibm.com>

---------

Signed-off-by: romit <romit@ibm.com>
Signed-off-by: r0 <11757603+romitjain@users.noreply.github.com>
Co-authored-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
Signed-off-by: Padmanabha V Seshadri <seshapad@in.ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
…603)

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
* restructure README

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

* split readme

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

* Update README.md

Co-authored-by: Praveen Jayachandran <praveenj83@users.noreply.github.com>
Signed-off-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>

* update readme

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

* Update README.md

Co-authored-by: Praveen Jayachandran <praveenj83@users.noreply.github.com>
Signed-off-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

---------

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
Co-authored-by: Praveen Jayachandran <praveenj83@users.noreply.github.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
* Updated LoraConfig to subclass from peft.LoraConfig
Signed-off-by: romit <romit@ibm.com>

* Added some fields under custom dataclass to let is pass through HfArgumentParser
Signed-off-by: romit <romit@ibm.com>

* Lint and fmt fixes
Signed-off-by: romit <romit@ibm.com>

* Updated config utils

Signed-off-by: romit <romit@ibm.com>

* Update comment in LoraConfig

Signed-off-by: r0 <11757603+romitjain@users.noreply.github.com>

* Lint changes

Signed-off-by: romit <romit@ibm.com>

* Updated comment

Signed-off-by: romit <romit@ibm.com>

---------

Signed-off-by: romit <romit@ibm.com>
Signed-off-by: r0 <11757603+romitjain@users.noreply.github.com>
Co-authored-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Fix image in README.

Signed-off-by: Dushyant Behl <dushyantbehl@users.noreply.github.com>
* feat: add ckpt conversion script fp32-bf16

Signed-off-by: yashasvi <yashasvi@ibm.com>

* feat: add remove optim func to checkpoint_util.py

Signed-off-by: yashasvi <yashasvi@ibm.com>

* feat: add inplace file deletion capability

Signed-off-by: yashasvi <yashasvi@ibm.com>

---------

Signed-off-by: yashasvi <yashasvi@ibm.com>
feat: add odm plugin

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

---------

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
* feat: resume functionality

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* feat: resume functionality

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* feat: resume functionality

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* feat: resume functionality

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* feat: resume functionality

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: refactor code

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: refactor code

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: refactor code

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

* fix: refactor sampling weight

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>

---------

Signed-off-by: Mehant Kammakomati <mehant.kammakomati2@ibm.com>
feat: Alora migration to PEFT upstream.

Signed-off-by: yashasvi <yashasvi@ibm.com>

---------

Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: yashasvi <yashasvi@ibm.com>
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
* Add free up disk space to gh runners

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

* Build flash attn without cache to avoid problems

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>

---------

Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
@github-actions
Copy link
Copy Markdown

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

@github-actions github-actions Bot added the chore label Nov 10, 2025
Signed-off-by: Dushyant Behl <dushyantbehl@in.ibm.com>
@dushyantbehl dushyantbehl merged commit 8659901 into release Nov 10, 2025
10 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants